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5.3.2 Bivariate Normal Distribution 

Remember that the normal distribution is very important in probability theory and it shows up in many different 
applications. We have discussed a single normal random variable previously; we will now talk about two or more normal 
random variables. We recently saw in Theorem 5.2 that the sum of two independent normal random variables is also 
normal. However, if the two normal random variables are not independent, then their sum is not necessarily normal. Here 
is a simple counterexample: 

Example 5.35 

Let X ~ iV(0, 1) and W ~ Bernoulli (|-) be independent random variables. Define the random variable l^asa 
function of X and W : 


Y = h(X, W ) 



Find the PDF of Y and X + Y. 


Solution 

o Note that by symmetry of N (0, 1) around zero, —X is also N (0, 1). In particular, we can write 

Fr(y) = P{Y < y) 

= P(Y < y\W = 0 )P(W = 0) + P(Y <y\W= 1 )P(W = 1) 

— i P(X < y\W = 0) + \P{~X < y\W = 1) 

— —P{X < y) + —P(—X < y) (since X and W are independent) 

£ At 

— — $(y) H — <&(y) (since X and —X are iV(0,1)) 

A A 

= $(y). 

Thus, Y N( 0,1) . Now, note that 


r 2x 

with probability j 

Z = X + Y= 1 


lo 

with probability j 

dom variable and its PDF is given by 

fz(z) = + i 

(PDF of 2X at z), 

fz(z) = 16(z) + i 

(PDF of a N(0, 4) at z) 

1 1 

z - 2 

“ 2 S{Z) + 4V2? e ’ ' 


In particular, note that X and Y are both normal but their sum is not. Now, we are ready to define bivariate 
normal or jointly normal random variables. 
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Definition 5.3 

Two random variables X and Y are said to be bivariate normal, or jointly normal, if aX + bY has a normal distribution 
for all a, b G R. 

In the above definition, if we let a — 6 = 0, then aX + bY — 0. We agree that the constant zero is a normal random 
variable with mean and variance 0. From the above definition, we can immediately conclude the following facts: 

- If X and Y are bivariate normal, then by letting a — 1, b — 0, we conclude X must be normal. 

- If X and Y are bivariate normal, then by letting a — 0, b — 1, we conclude Y must be normal. 

- if X ~ N {px > a x ) and Y ~ iV {fly , 0y) are independent, then they are jointly normal (Theorem 5.2). 

-ifX r^j N{fi x ,a 2 x ) and Y N{fly, Cy) are jointly normal, then 

X + Y ~ N{fJ, x + fly, <J 2 X + ay + 2p(x, Y)a x ay) (Equation 5.21). 

But how can we obtain the joint normal PDF in general? Can we provide a simple way to generate jointly normal random 
variables? The basic idea is that we can start from several independent random variables and by considering their linear 
combinations, we can obtain bivariate normal random variables. Similar to our discussion on normal random variables, we 
start by introducing the standard bivariate normal distribution and then obtain the general case from the standard one. The 
following example gives the idea. 

Example 5.36 

Let Zi and Z 2 be two independent iV(0,1) random variables. Define 

x=z u 

Y — pZl + yjl — p 2 Z 2 , 


where p is a real number in ( — 1, 1). 

a. Show that X and Y are bivariate normal. 

b. Find the joint PDF of X and Y. 

c. Find p{X, Y). 

• Solution 

o First, note that since Z\ and Z 2 are normal and independent, they are jointly normal, with the joint PDF 

fz 1 z 2 (zi,z 2 ) = fz,{zi)fz 2 {z 2 ) 



a. We need to show aX + bY is normal for all a, b G R. We have 

aX + bY — aZ\ + b{pZ\ + y^l — p 2 Z 2 ) 

— {ct + bp)Z\ + b^Jl — p 2 Z 2 , 

which is a linear combination of Z\ and Z 2 and thus it is normal. 

b. We can use the method of transformations (Theorem 5.1) to find the joint PDF of X and Y . The 
inverse transformation is given by 

Z 1 =X = h 1 {X,Y), 

^ = -7=r*+7iV = ft2(x ’ y) ' 


We have 
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fxv(zi,Z2) = fz 1 z a (hi(x,y),h 2 (x,y))\J\ 

= fZiZ 2 ( x i - 


V 1 -p 2 V 1 - p‘ 


rVM , 


where 



’ dh\ 

dh\ 



dx 

dy 


J — det 

dh. 2 

dh 2 

— det 


dx 

dy . 





V l ~p 2 


Thus, we conclude that 


fxy( x ,y) — fz,z 2 (x, - 


V 1 - p 2 V 1 - p‘ 


-y)\J\ 


= ^ exp 


+ t — w(-P x + y? 


i - p 


\/!-P 2 


277^/1 - p 2 

. To find p(X, y), first note 


exp 


2(1 -P 2 ) 


x 2 -2pxy + y' 


Var(X) = Var(Zi) = 1 , 

Var{Y) = pVar(Zi) + (1 - p 2 )Var(Z 2 ) = 1. 

Therefore, 

P (x,y) - Cov(x,y) 

= Cov{Zi,pZi + •y/l — P 2 z 2 ) 

= pCov(Z 1 , Z x ) + — p 2 Cov(Z 1 , z 2 ) 

= p • i + y^i — p 2 • o 
= p- 


We call the above joint distribution for JT and Y" the standard bivariate normal distribution with correlation coefficient p. 
It is the distribution for two jointly normal random variables when their variances are equal to one and their correlation 
coefficient is p. 

Two random variables X and Y are said to have the standard bivariate normal distribution with correlation coefficient p 
if their joint PDF is given by 


!xy{x ' y) = ~ 2w+!/2!} ' 
where p £ ( — 1, 1). If p — 0, then we just say X and Y have the standard bivariate normal distribution. 

Now, if you want two jointly normal random variables X and Y such that X ~ N ( px > a X )> Y ~ N{hy,(t\), 
and p(X, Y) — p, you can start with two independent iV(0, 1) random variables, Z\ and Z%, and define 
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j X — crxZi + px 

l Y = a Y (pZ 1 + + p Y (5.23) 

We can find the joint PDF of X and Y as above. While the joint PDF has a big formula, we usually do not need to use the 
formula itself. Instead, we usually work with properties of jointly normal random variables such as their mean, variance, 
and covariance. 


Definitions 5.3 and 5.4 are equivalent in the sense that, if X and Y are jointly normal based on one definition, they are 
jointly normal based on the other definition, too. The proof of their equivalence can be concluded from Problem 10 in 
Section 6.1.6. In that problem, we show that the two definitions result in the same moment generating functions. 


Definition 5.4 

Two random variables X and Y are said to have a bivariate normal distribution with parameters px, a X’ PY, 
a y, and p, if their joint PDF is given by 


fxr(x,y) 


1 

2Traxo- Y \/l ~ p 2 


exp 


1 

2(1 -P 2 ) 


f x - p x 

V a x 


2 


( v~py 
V a Y 



Px)(y~PY) 

cr X&Y 


(5.24) 


where px, PY €= R, &x, cry > 0 and p £ ( — 1, 1) are all constants. 


In the above discussion, we introduced bivariate normal distributions by starting from independent normal random 
variables, Z\ and Z 2 - Another approach would have been to define the bivariate normal distribution using the joint PDF. 
The two definitions are equivalent mathematically. In particular, we can state the following theorem. 


Theorem 5.3 

Let X and Y be two bivariate normal random variables, i.e., their joint PDF is given by Equation 5.24. Then there exist 
independent standard normal random variables Z\ and Zi such that 

f X — crxZi + px 

(F = <Jy(pZi + y/1 — p 2 Z 2 ) + p Y 


Proof. (Sketch) To prove the theorem, define 

X-Px 
<rx 

_ P 1 Y—fly 

y/T -p 2 y/l-p? cry 

Now find the joint PDF of Z\ and Zi using the method of transformations (Theorem 5.1), similar to what we did above. 
You will find out that Z\ and Zi are independent and standard normal and by definition satisfy the equations of Theorem 
5.3. The reason we started our discussion on bivariate normal random variables from Z\ and Z 2 is three fold. First, it is 
more convenient and insightful than the joint PDF formula. Second, sometimes the construction using Z\ and Z 2 can be 
used to solve problems regarding bivariate normal distributions. Third, this method gives us a way to generate samples 
from the bivariate normal distribution using a computer program. Since most computing packages have a built-in 
command for independent normal random variable generation, we can simply use this command to generate bivariate 
normal variables using Equation 5.23. 

Example 5.37 

Let X and Y be jointly normal random variables with parameters px, cr|-, py , <Jy > an d P- Fi nc * the conditional 
distribution of Y given X — X. 
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Solution 

o One way to solve this problem is by using the joint PDF formula (Equation 5.24). In particular, since 

X~N(p x ,a 2 x )p 


, we can use 


fY\x(y\ x ) — 


fxr(x,y) 

fx(x) 


Another way to solve this problem is to use Theorem 5.3. We can write 

X — a x Zi + p x 
Y — cr Y (pZi + y/l — p 2 Z 2 ) + Hy 


Thus, given X — X,we have 


and 


^ X~ PX 

Z\ — -, 

ax 


X — Ux / o 

Y — ayp - \- ay\ 1 — p Z 2 + py ■ 

a x v 

Since Z\ and Z 2 are independent, knowing Z\ does not provide any information on Zi - We have shown 
that given X = X,Y is a linear function of Z 2 , thus it is normal. In particular 


E[Y\X = x\ = a Y p X ^ X +a Y Jl- p 2 E[Z 2 


ax 


Py 


— py + pay- 


x - px 
ax 


Var(Y\X = x) = a 2 ( 1 - p z )Var(Z 2 ) 

= (1 ~P 2 )a 2 y 

We conclude that given X — X, Y is normally distributed with mean py+ pay 

(1 -P 2 )a 2 y 


ax 


and variance 


Theorem 5.4 

Suppose X and Y are jointly normal random variables with parameters px, a x , py , CTy, and p. Then, given X — X, 
Y is normally distributed with 


E[Y\X — x\ — py + pay-—— 

ax 

Var(Y\X = x) - (l-p 2 )crf. 


Example 5.38 

Let X and Y be jointly normal random variables with parameters px 



l,a x ~ 1, py ~ 0,(Ty = 4, and 


a. Find P(2X + Y < 3). 

b. Find Cov(X + Y, 2X - Y). 
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c. Find P(Y >1\X = 2). 

• Solution 

o a. Since X and Y are jointly normal, the random variable V — 2X + Y is normal. We have 

EV = 2 EX + EY = 2, 

Var(V ) = 4 Var(X) + Var(Y ) + 4Cov(X, Y) 

— 4 + 4 + 4crxcryp(X, y) 

1 

= 8 + 4xlx2x - 
2 

= 12 . 

Thus, V ~ N (2, 12). Therefore, 

b. Note that Cov(X, Y) — ax<XYp(X,Y) — 1. We have 

Cov(X + Y, 2X - Y) = 2Cov(X, X) - Cov(X, Y) + 2Cov(F, X) - Cov(F, Y) 

= 2- l + 2- 4 = -l. 

c. Using Theorem 5.4, we conclude that given X = 2, 3^ is normally distributed with 

E[Y\X = 2] = HY + pg-y 2 ~ = 1 

crx 

Far(F|X = x) = (1 - p 2 )c4 = 3. 

Thus 

p ( r > i|x = 2 ) = i - $ (XI) = 1. 


Remember that if two random variables X and Y are independent, then they are uncorrelated, i.e., Cov(X, Y) = 0 . 
However, the converse is not true in general. In the case of jointly normal random variables, the converse is true. Thus, for 
jointly normal random variables, being independent and being uncorrelated are equivalent. 

Theorem 5.5 

If X and Y are bivariate normal and uncorrelated, then they are independent. 

Proof. Since X and Y are uncorrelated, we have p(X, Y) — 0. By Theorem 5.4, given X ~ X,Y is normally 
distributed with 


E[Y\X — x] — py + pcry- — — = py, 

crx 

Var(Y\X — x) = (1 — p 2 )cry = <y\. 

Thus, fy\x{y\ x ) ~ /y(y) for all X,y G M. Thus X and Y are independent. Another way to prove the theorem is to 
let p = 0 in Equation 5.24 and observe that fxY{ x iV ) = /x(*)/y(y)- 
<— previous 
next —> 
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